Gene Characterization Index: Assessing the Depth of Gene Annotation
نویسندگان
چکیده
BACKGROUND We introduce the Gene Characterization Index, a bioinformatics method for scoring the extent to which a protein-encoding gene is functionally described. Inherently a reflection of human perception, the Gene Characterization Index is applied for assessing the characterization status of individual genes, thus serving the advancement of both genome annotation and applied genomics research by rapid and unbiased identification of groups of uncharacterized genes for diverse applications such as directed functional studies and delineation of novel drug targets. METHODOLOGY/PRINCIPAL FINDINGS The scoring procedure is based on a global survey of researchers, who assigned characterization scores from 1 (poor) to 10 (extensive) for a sample of genes based on major online resources. By evaluating the survey as training data, we developed a bioinformatics procedure to assign gene characterization scores to all genes in the human genome. We analyzed snapshots of functional genome annotation over a period of 6 years to assess temporal changes reflected by the increase of the average Gene Characterization Index. Applying the Gene Characterization Index to genes within pharmaceutically relevant classes, we confirmed known drug targets as high-scoring genes and revealed potentially interesting novel targets with low characterization indexes. Removing known drug targets and genes linked to sequence-related patent filings from the entirety of indexed genes, we identified sets of low-scoring genes particularly suited for further experimental investigation. CONCLUSIONS/SIGNIFICANCE The Gene Characterization Index is intended to serve as a tool to the scientific community and granting agencies for focusing resources and efforts on unexplored areas of the genome. The Gene Characterization Index is available from http://cisreg.ca/gci/.
منابع مشابه
Identification of Prognostic Genes in Her2-enriched Breast Cancer by Gene Co-Expression Net-work Analysis
Introduction: HER2-enriched subtype of breast cancer has a worse prognosis than luminal subtypes. Recently, the discovery of targeted therapies in other groups of breast cancer has increased patient survival. The aim of this study was to identify genes that affect the overall survival of this group of patients based on a systems biology approach. Methods: Gene expression data and clinical infor...
متن کاملClustering of a Number of Genes Affecting in Milk Production using Information Theory and Mutual Information
Information theory is a branch of mathematics. Information theory is used in genetic and bioinformatics analyses and can be used for many analyses related to the biological structures and sequences. Bio-computational grouping of genes facilitates genetic analysis, sequencing and structural-based analyses. In this study, after retrieving gene and exon DNA sequences affecting milk yield in dairy ...
متن کاملMolecular characterization of the lipL41 gene of Leptospira interrogans vaccinal serovars in Iran
Leptospirosis caused by infection with pathogenic leptospires, which is the most prevalent zoonotic disease in the world. The outer membrane proteins (OMPs) of pathogenic leptospires such as LipL41 play a crucial role in pathogenesis of this disease. Therefore a major challenge to develop an effective vaccine against leptospirosis is application of basic research on the OMPs of leptospires to i...
متن کاملCharacterization of Iranian Avian Metapneumovirus based on Fusion Gene (F)
Avian metapneumovirus (aMPV) represents one of the most prevalent diseases of poultry mainly in combination with other pathogens, and it is increasing among chickens. In the present study, the detection and characterization of an aMPV subtype B strain circulating in broiler flocks based on fusion (F) gene. In phylogenetic analysis, the isolates are located in B subtype cl...
متن کاملPURIFICATION AND CHARACTERIZATION OF THE CLONED HUMAN GM-CSF GENE EXPRESSED IN ESCHERICHIA COLI
The human granulocyte-macrophage colony stimulation factor (hGM-CSF) gene was cloned in the pET 23a( +) expression vector under the control of strong bacteriophage T7 transcription and translation signals. The hGM-CSF gene was transferred into E. coli strainBL21 (DE3)pLysS andIPTG was used for induction of GM-CSF gene. Production of the target protein was obtained as revealed by ELISA and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PLoS ONE
دوره 3 شماره
صفحات -
تاریخ انتشار 2008